Reconstruction of Ancestral Gene Order after Segmental Duplication and Gene Loss
نویسندگان
چکیده
As gene order evolves through a variety of chromosomal rearrangements, conserved segments provide important insight into evolutionary relationships and functional roles of genes. However, gene loss within otherwise conserved segments, as typically occurs following large-scale genome duplication, has received limited algorithmic study. This has been a major impediment to comparative genomics in certain taxa, such as plants and fish. We propose a heuristic algorithm for the inference of ancestral gene order in a set of related genomes that have undergone large-scale duplication and gene loss. First, approximately conserved (i.e. homologous) segments are identified using pairwise local genome alignment. Second, homologous segments are iteratively clustered under the control of two parameters, (1) the minimal required number of shared genes between two clusters and (2) the maximal allowed number of rearrangement breakpoints along the lineage leading to each descendant segment. Finally, we compute an estimated ancestral gene order for each cluster that is optimal in some sense. We evaluate the performance of this algorithm on simulated data that models a genome evolving by large-scale duplication, duplicate gene loss, transposition, translocation, and inversion. The results suggest that long segments of ancestral gene order may be reconstructed following moderate levels of rearrangement with only minor loss of accuracy.
منابع مشابه
Gene Family: Structure, Organization and Evolution
Gene families are considered as groups of homologous genes which they share very similar sequences and they may have identical functions. Members of gene families may be found in tandem repeats or interspersed through the genome. These sequences are copies of the ancestral genes which have underwent changes. The multiple copies of each gene in a family were constructed based on gene duplicati...
متن کاملReconstruction of Ancestral Gene Order Following Large Scale Genome Duplication and Gene Loss
Gene order evolves through gross chromosomal rearrangements, small scale inversions and transpositions, gene duplication, and gene loss. Much research has been done on the calculation of edit distance and on sorting algorithms under a variety of rearrangement models in which the genome may be represented as conserved segments with permuted order and orientation. However, gene loss within otherw...
متن کاملSporadic Gene Loss After Duplication Is Associated with Functional Divergence of Sirtuin Deacetylases Among Candida Yeast Species
Gene duplication promotes the diversification of protein functions in several ways. Ancestral functions can be partitioned between the paralogs, or a new function can arise in one paralog. These processes are generally viewed as unidirectional. However, paralogous proteins often retain related functions and can substitute for one another. Moreover, in the event of gene loss, the remaining paral...
متن کاملSupplemental Section S 1 – Genome
Supplemental Section S1 – Genome Sequencing and Assembly........................................... 2 Supplemental Section S2 – Indel Assessment With the Neutral Indel Model ................11 Supplemental Section S3 – Great Ape Divergence Estimate via WGS Read Mapping..11 Supplemental Section S4 – Short Read Sequencing................................................................13 Suppl...
متن کاملInferring Ancestral Gene Orders for a Family of Tandemly Arrayed Genes
Tandemly arrayed genes (TAG) constitute a large fraction of most genomes and play important biological roles. They evolve through unequal recombination, which places duplicated genes next to the original ones (tandem duplications). Many algorithms have been proposed to infer a tandem duplication history for a TAG cluster. However, the presence of different transcriptional orientations in many c...
متن کامل